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(54) Enhanced message centre 

(57) Communication Apparatus with Text-to- 
Speech Capability 

An enhanced message centre tor a telephone network 
comprises a connection (21) to the Intemet a connec- 
tion (22) to the PSTN, a PSTN interface unit (23) cou- 
pled to the connection (22) to the PSTN» a character 
recognition unit (24) coupled to receive an output of the 
PSTN interface unit (23). a language recognition unit 
(25) coupled to receive a text signal (26) and a writing 
system signal (27) from the character recognition unit 
(24) and e-mail text data from the Internet connection 



(21). a text-to-speech unit (28) coupled to receive a text 
signal (29) and a language signal (30) from the lan- 
guage recognition unit (25). and a storage unit (30) cou- 
pled to receive speech data from the text-to-speech unit 
(28) and an output of the PSTN interface unit (23). The 
storage unit (30) is provided with a link to a satellite 
mobile telephone system, illustrated by a SAN (4). a 
satellite (6) and a UT (7). A machine translation unit 
may also be provided. 
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Description 



I me^^l processing apparatus 
lessli^Pn a telephone system, 



The present invention relates to an enhanced mes- 
sage centre. 

Telephone systems suffer from the problem that a 
subscriber is not always available to anwer a call. One 
solution to this problem is the use of the now ubiquitous 
tel^hone answering machine. 

This, however, does not provide a solution in the 
case of mobile telephone systems where the unavaila- 
bility of a subscriber is often linked to the tack of an open 
channel betwem the nrkobile telephone network and a 
nrK>bile telephone. For instance, the mobile telephone 
may be switched off or may be located out of range of 
any of the network's base stations. One solution to this 
secorxJ problem is to effectively nxive the telephone 
answering machine from the sij6scrit>er's terminal into 
the networks control system. Thus, if a sut>scriber can- 
not be reached, the calling party can leave a voice mes- 
sage whk:h can be retrieved later by the called party. 
Such a system is not, of course, limited to mobile tele- 
phone systems in its application. 

It has also been proposed that e-mail messages be 
converted into speech for delivery to a subscriber using 
a simple telephone in a manner similar to the afore- 
mentioned voice messages. 

The present invention provides an extension of 
these message handling systems to include facsimile 
transmissions and introduces linguistic functions, e.g. 
autonr^tic or user selected translation of messages. 

According to a first aspect of the present invention, 
there is provkled a message processir^g apparatus for 
storing voice mail messages in a telephone system, the 
message processing apparatus comprising identifying 
means to identify an incoming signal as a facsimile 
transmission, means responsive to the identifying 
means to store the incoming signal, character recogni- 
tion means for converting writing represented by the 
incorreng signal into a text file, and text-to-speech 
processing nr>eans for converting the text file into 
speech data and storing the speech data as a voice mail 
message. 

Preferably, an apparatus according to the present 
invention includes translation means for translating the 
text in said text file from a first language to a second lan- 
guage. More preferably, means are provided whereby a 
sut>scriber can select a language to be the second lan- 
guage. Conveniently, means is provided to identify said 
first language on the basis of the incoming signal and/or 
the text file. The language identification may employ 
both image processing and linguistic analysis tech- 
niques. 

Convenientiy, an apparatus according to the 
present invention includes means for receiving e-mail, 
means for converting the received e-mail into a text file 
suitable for processing by the text-to-speech processing 
means. 

According to a second aspect of the present inven- 



tion, there is provided a i 
for storing voice mail me 
tiie message processing apparatus comprising transla- 
tion means for tianslating text from a first language to a 
5 second language. The apparatus may be adapted to 
translate e-nnail messages. 

Embodiments of the present invention will now t>e 
descril>ed, by way of example, with refererKe to the 
accompanying drawings, in which: 

10 

Figure 1 shows the general form of a communica- 
tions system according to the present invention; 
Figure 2 is a block diagram of a first embodiment of 
an enhanced message cerrtre according to tiie 
IS present invention; 

Rgure 3 is a block diagram of the PSTN interface 
unit of Figure 2; 

figure 4 is a block diagram of the character recog- 
nition unit of Rgure 2; 
20 Figure 5 is a block diagram of the language recog- 
nition unit of Figure 2; 

Rgure 6 is a block diagram of the text-to-speech 
unit of Figure 2; arKi 

Rgure 7 is a block diagram of a second embodi- 
es merrt of an enhanced message centre according to 
the present invention. 

Referring to Figure 1. a communications system 
includes the public switched telephone network (PSTN) 

30 1, which for present purposes may be considered to 
include kx)th fixed arxj cellular mobile telephones, the 
Internet 2 and a satellite mobile telephone system 3. 
The satellite mol>ile telephone system 3 includes a plu- 
rality of satellite access nodes (SAN) 4, a control centre 

35 5. a plurality of satellites 6 (one shown) and a plurality of 
mobile telephones (UT) 7 (one shown). The SANs 4 are 
interconnected by a high speed data link 8. The SANs 4 
are also connected to the corrtrol centre 5 and to tiie 
PSTN 1 by high speed data links 9. 10. The control cen- 

40 tre 5 is connected to the Internet 2. The SANs 4 com- 
municate with UTs 7 via tiie satellites 6. 

The control centre 5 includes a message centre 1 1 . 
The message centre 1 1 provides voice messaging and 
e-mail text-to-speech messaging as known in the state 

45 of tiie art. However, it also provides enhancements to 
these services. These enhancements will now be 
described. 

Referring to Rgure 2. the message centi-e 1 1 com- 
prises a connection 21 to tiie Internet, a connection 22 

so to tiie PSTN, a PSTN interface unit 23 coupled to the 
connection 22 to the PSTN, a character recognition unit 
24 coupled to receive an output of tiie PSTN interface 
unit 23. a language recognition unit 25 coupled to 
receive a text signal 26 and a writing system signal 27 

55 from the character recognition unit 24 and e-mail text 
data from the Internet connection 21, a text-to-speech 
unit 28 coupled to receive a text signal 29 and a lan- 
guage signal 30 from the language recognition unit 25, 



2 



BNeOOCtD: *i£P_gS4Ma6A1JL> 



EP 0 849 925 A1 



and a storage unrt 30 coLfpHRb receive speech data 
from the text-to-speech unit 28 and an output of ttie 
PSTN interface unit 23. The storage unit 30 is provided 
with a link to a satellite mobile telephone system, illus- 
trated by a SAN 4. a satellite 6 and a UT 7. 5 

When a UT cannot be reached, calls and e*mail are 
redirected to the message caitre 11. Calls from the 
PSTN 1 are applied to the PSTN interface unit 23 which 
determines whether a call comprises speech or facsim- 
ile data. If the call comprises speech, the received 10 
speech is output to the storage unit 30. K. on the other 
hand, the received signal is a facsimile transmission, 
the facsimile data is passed on to the character recogni- 
tion unit 24. The character recognition unit 24 analyses 
the facsimile data to determine whether it contains text, is 
In determining whether the facsimile signal contains 
text, the character recognition unit 24 applies various 
tenplates repres^rting different writing systems, e.g. 
Roman alphabet. Cyrillic alphabet, Chinese characters. 
Japanese (Kanji, Hiragana arxi Katakana) or Korean so 
(Hangul). Once, a writing system is identified, the char- 
acter recognition generates character data comprising 
codes for the characters recognised in the facsimile 
data signal. The cfiaracter data signal is output together 
with a data irKiicating the identified writing system 2s 

The character data signal and the writing system 
data signals are applied to language recognition unit 25. 
The language recognition unit 25. identifies the lan- 
guage of text represented by the character data signal 
with reference to the writing system signal. Once, the 30 
language has been identified, the language recognition 
unit outputs the character data signal and a language 
signal to the text-to-speech 28. 

The text-to-^eech unit 28 generates an encoded 
speech signal from the character data signal in accord- 35 
ance with the language signal. The encode speech sig- 
nal is output to the storage unit 30 where it is stored. 
The encoded speech signal is in the form used by the 
satellite mobile telephone system for ordinary transmis- 
sion of speech between subscribers. 40 

When a UT 7 registers with the satellite mobile tel- 
ephone networK the message centre 1 1 is notified arKl. 
if there are messages waiting from the UT 7. the mes- 
sage centre 11 calls the UT 7 to inform the user that 
there are messages waiting. If the user wishes to hear 45 
his messages, he responds by entering a predeter- 
mined key sequence which causes a command code to 
be serrt Ijack to the message centre 11. The message 
centre 11 then retrieves the encoded speech signals 
from the storage unit 30 and sends them to the UT 7. so 

Referring to Rgure 3, the PSTN interface unit 23 
comprises a microcomputer sub-system 40, a "ring 
detector** and "hooX' simulating circuit 41 for detecting 
iricoming calls and signalling that connection has been 
established, and a dual tone mufti-frequency (DTMF) 55 
decoder 42. The microconputer sub-system 40 com- 
prises a microprocessor 43, a random-access memory 
(RAM) 44, a read-only memory (ROM) 45, a local net- 



work interface arcuit 46. a digital-to-ar^afogue converter 
(OAC) 47, a fax-modem drcutt 48 and an ana!ogue-to- 
digital converter (ADC) 49. The components of the 
microcomputer sub-system 40 are interconnected by 
address and data bus 50. 

When an irxx>ming call is detected t3y the simulating 
circuit 41 . the simulating circuit 41 applies a signal to an 
interrupt port of the microprocessor 43. In response to 
this, the microprocessor 43 issues an 'off hook" com- 
mand to the simulating drcuit 41 which then signals 
back to the PSTN that the call has been oorviected. 
Then the microprocessor 41 reads speech data from 
the ROM 45 and sends it to the DAC 47. The DAC 47 
outputs an analogue speech signal to the line from the 
PSTN. The speech signal comprises the message "If 
you wish to leave a voice mail message, press gate(#)". 

If the calling party is makir^ a speech call, he will 
hear the message and either press the # key on his tel- 
ephone or hang up. The nticroprocessor 43 monitors 
the output of the DTMF decoder 42 for a short period, 
e.g. 5 sec. If a # signal is rec^ved in tfits period, the 
microprocessor 43 decides that a voice mail message is 
to be left and t>egins to read data from the ADC 48. The 
data read from the ADC 48 is encoded by tf>e micro- 
processor 43 to compress it and stored in the RAM 44. 
After 20 seconds, the mterocomputer 43 issues an "on 
hook" commarxj to the circuit 41 which signals to the 
PSTN that the connection has been terminated. The 
encoded speech data is read from the RAM 44 and sent 
via the network interface circuit 46 to the storage unit 30 
(Figure 2) for storage as conventfonal voice mail. 

If on the other hand, a # signal is not received^ the 
mk^roprocessor 43 instructs the fauc-mod^ circuit 49 to 
send fax negotiating signals to the PSTN. If a facsimile 
machine is making the call, it will resporxi and start to 
transmit facsimile signals. If no response is received l>y 
the fax-modem circuit 49 within 10 seconds, microproc- 
essor 43 sends the "on hook" command to the circuit 
41: 

Received fax signals are decoded by the fax- 
modem circuit 49 and read by the microprocessor 43 
which stores them in a TIFF file in the RAM 44. When all 
of the fax data has been received, microcomputer 43 
sends the "on hook" command to the circuit 41 and 
sends the TIFF file to a network storage devrce that can 
be accessed by the character recognition unit 24 (Rg- 
ure 2), via the network interface circuit 46. ^ 

Referring to Figure 4. the character recognition' unit 
24 comprises a converrtional microcomputer 50. The 
microcomputer 50 itself conprises a microprocessor 
51, a RAM 52. a ROM 53, a focal network interface' 54 
and an address and data bus 55, interconnection the 
microconputer 50. the RAM 51, the ROM 52 and ihe 
local network interface 54. 

When character recognition is to t>e earned out, the 
microcomputer 50 retrieves a TIFF file from the network 
storage device and stores it in the RAM 52. The writing 
system used fw the facsimile represented l>y the TIFF 
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file must first be determined. A suitabld^^od of writ- 
ing system identification is disclosed in "Script Determi- 
nation in Document lmages\ Spitz. A.L, SPIE. vol. 
2422. pp. 293-301. 

Once the writing system has been identified, the s 
microcomputer 50 processes the TIFF file according to 
the appropriate programme to produce a text file. The 
text file contains a header identifying the writing system 
used and codes representing the text in the facsimile. 
The text file is then sent to the network storage device io 
by the microcomputer 50 via the local network interface 
54. 

Referring to Rgure 5. the language recognition unit 
25 comprises a corwentional microcomputer 60. The 
microcomputer 60 itself comprises a microprocessor is 
61, a RAM 62, a ROM 63. a local network int^ce 64 
and an address and data Ixjs 65. interconnection the 
microcomputer 60. the RAM 61. the ROM 62 and the 
local network interface 64. 

When language recognition is to be carried out the 20 
microcomputer 60 retrieves the text file, stored by the 
character recognition unit 24, from the network storage 
device. The microcomputer 60 reads the text file's 
header. There are three possible relationships between 
the header arKl the language of the text In the first 2s 
case, the language Is inherently, uniquely identified by 
the writing system, e.g. Japanese or Korean. In the sec- 
ond case, a language cannot be identified by the writing 
system, even with reference to the text as is the case 
with Chinese characters. In this second case, a decision 30 
must be made to use a particular language, for Instance 
Cantonese or Marxiarin. as a default. In the third case, 
the language cannot be identified from the header 
alone. However, it can be identified from analysis of the 
text. Typical examples are the Roman and Cyrillic alpha- 35 
bets. 

In the first two cases, the microconrputer 60 merely 
restores the retrieved file in the network storage device. 
In the third case, the microconrputer 60 analyses the 
text using letter frequency taWes or by identifying char- 40 
acteristic characters or character groups to identify the 
language of the text. Once a decision regarding the lan- 
guage of the text has been made, the retrieved file is 
stored in the network storage device with a new header 
identifying a language. 4S 

The language recognition unit 25 also processes e- 
mail messages- E-nr>ail message files are retrieved from 
the network storage device. The microprocessor 60 
then analyses the text on the assumption that the writ- 
ing system used is the Roman alphat)et. Once a deci- so 
sion regarding the language has been made, the e-mail 
message is stored in the network storage device with a 
header indicating the language determined to l>e being 
used. 

Referring to Figure 6. the texl-to-speech unit 28 55 
comprises a conventional microcomputer 70. The 
microccMfnputer 70 itself conprises a microprocessor 
71, a RAM 72. a ROM 73. a local network interface 74 



5. interconnection the 
the ROM 72 and the 



925 A1 

and an address and data 
microconputer 70, the 
local network interface 74. 

When text-to-speech conversion is to be carried 
out, the microconputer 70 retrieves a file which has 
been stored on the network storage unit by the lan- 
guage recognition unit 25. The microconputer 70 then 
performs a text-to-speech operation using an appropri- 
ate program selected on the basis of the file's header. 
The speech data so produced is then coded to com- 
press it. The coded speech is then sent to the storage 
unit 30 for later transmission to a UT 7. 

In the foregoing description, the language of a mes- 
sage is determined solely from the characteristics of the 
message Itself. However, additional information such as 
the geographical source of a message may be used to 
assist in Identifying a language. 

A second embodiment of the present inverrtion pro- 
vides a machine translation function and will now be 
described. 

Referring to Rgure 7, the second embodiment of an 
enhanced message centre according to the present 
invention is sut>stantially the same as that shown in Fig- 
ure 1 . save for tiie additfon of a translation unit 80. 

The translation unit 80 retrieves text files from the 
network storage device and, if necessary, translates the 
text Into a language selected by the addressee. The 
preferred language is selected from a menu displayed 
on the UT 7 during a messaging set up procedure. Once 
the translation unit 80 has performed the necessary 
processing, the resultant text 81 and language identifi- 
cation data 81 are stored In a file in the network storage 
device for retrieval by the texl-to-speech unit 28. 

Since incoming fecsimile messages all pass 
through a text phase, the filtering arrangements availa- 
ble in e-mail systems may be enployed. 

I will be appreciated tfiat the translation facility of 
the second embodiment may be applied to an e-mail 
only system, e.g. a system as shown in Figure 7 but 
lacking the PSTN interface unit 23 and the character 
recognition unit 24. 

It will be appreciated that the present invention may 
be embodied using a single microconputer to perform 
the functions of tiie PSTN Interface unit 23, the charac- 
ter recognition unit 24. the language recognition unit 25, 
the text-to-speech unit 28 and, if used, the translation 
unit 80. 

Claims 

1. A message processing apparatus for storing voice 
mail messages in a telephone system, the mes- 
sage processing apparatus comprising identifying 
means to identify an incoming signal as a facsimile 
transmission, means responsive to the identifying 
means to store the incoming signal, character rec- 
ognition means for converting writing represented 
by the incoming signal into a text file, and text-to- 
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file into speech data and storing the speech data as 
a voice mail message. 

2. A message processing systems according to dairn s 
1. including translation means for trcuislating the 
text in said text file from a first language to a second 
language. 

3. An apparatus according to claim 2, including means io 
whereby a sul^scriber can select a language to be 

the second language. 

4. An apparatus according to claim 2 or 3. includir^ 
means to identify said first language on the basis of is 
the incoming signal ch- said text file. 

5. An apparatus according to any preceding claim, 
including means for receiving e-mail, means for 
converting the received e-nr^l into a text file suita- so 
bte for processing by the text-to-speech processing 
means. 

6. A message processing apparatus for storing voice 
mail messages in a telephone system, the mes- 2s 
sage processing apparatus comprising translation 
means for translating text from a first language to a 
second language. 

7. An apparatus according to claim 6. including means 3o 
to receive e-mail, wherein the translation means is 
operable to translate a received e-mail message. 
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